Goto

Collaborating Authors

 Sussex County


Cross Domain Early Crop Mapping using CropGAN and CNN Classifier

Wang, Yiqun, Huang, Hui, State, Radu

arXiv.org Artificial Intelligence

Driven by abundant satellite imagery, machine learning-based approaches have recently been promoted to generate high-resolution crop cultivation maps to support many agricultural applications. One of the major challenges faced by these approaches is the limited availability of ground truth labels. In the absence of ground truth, existing work usually adopts the "direct transfer strategy" that trains a classifier using historical labels collected from other regions and then applies the trained model to the target region. Unfortunately, the spectral features of crops exhibit inter-region and inter-annual variability due to changes in soil composition, climate conditions, and crop progress, the resultant models perform poorly on new and unseen regions or years. This paper presents the Crop Generative Adversarial Network (CropGAN) to address the above cross-domain issue. Our approach does not need labels from the target domain. Instead, it learns a mapping function to transform the spectral features of the target domain to the source domain (with labels) while preserving their local structure. The classifier trained by the source domain data can be directly applied to the transformed data to produce high-accuracy early crop maps of the target domain. Comprehensive experiments across various regions and years demonstrate the benefits and effectiveness of the proposed approach. Compared with the widely adopted direct transfer strategy, the F1 score after applying the proposed CropGAN is improved by 13.13% - 50.98%


The Buzz Behind an App That Can Monitor Beehives Remotely

WIRED

You've probably heard by now that bees are dying in record numbers. They're being poisoned by pesticides while urbanization encroaches on bees' natural habitats, leaving them with fewer places to live and fewer wildflowers to feed on, says Harvard biologist James Crall, who studies bumblebees. The die-off comes as the world's human population is expected to grow from 7 billion in 2010 to 9.8 billion in 2050; as incomes rise, food producers will need to supply 56 percent more calories to meet growing demand, according to a December report by the World Resource Institute. That's going to be hard to do without the wild bees farmers have traditionally relied on to pollinate their crops. "An enormous amount of our food crops depend on animal pollinators," Crall says, highlighting fruits, nuts, and berries.


Rapid Adaptation of POS Tagging for Domain Specific Uses

Miller, John E., Bloodgood, Michael, Torii, Manabu, Vijay-Shanker, K.

arXiv.org Machine Learning

Part-of-speech (POS) tagging is a fundamental component for performing natural language tasks such as parsing, information extraction, and question answering. When POS taggers are trained in one domain and applied in significantly different domains, their performance can degrade dramatically. We present a methodology for rapid adaptation of POS taggers to new domains. Our technique is unsupervised in that a manually annotated corpus for the new domain is not necessary. We use suffix information gathered from large amounts of raw text as well as orthographic information to increase the lexical coverage. We present an experiment in the Biological domain where our POS tagger achieves results comparable to POS taggers specifically trained to this domain. Many machine-learning and statistical techniques employed for POS tagging train a model on an annotated corpus, such as the Penn Treebank (Marcus et al, 1993). Most state-of-the-art POS taggers use two main sources of information: 1) Information about neighboring tags, and 2) Information about the word itself. Methods using both sources of information for tagging are: Hidden Markov Modeling, Maximum Entropy modeling, and Transformation Based Learning (Brill, 1995).